Building a Transparent Batching Layer for Storm

نویسندگان

  • Matthias J. Sax
  • Malu Castellanos
چکیده

 Building a Transparent Batching Layer for Storm Matthias J. Sax, Malu Castellanos HP Laboratories HPL-2013-69 streaming data, distributed streaming system, batching, performance, optimization Storm is a distributed intra-node-parallel stream processing system built for very low latency processing. One major drawback of Storm is its relatively low throughput. In order to increase Storm's throughput, we designed a batching layer for Storm that is able to improve Storm's throughput significantly. In order to get a high user acceptance, we did not modify Storm but build the batching layer "on top" of it. The layer is transparent to the Storm system as well as to the user code, i.e., the user-defined functions. Thus, already developed Storm programs (so-called topologies) can benefit from our batching layer without modification. In this document, we describe the design of the batching layer and provide inside into some implementation details. External Posting Date: July 6, 2014 [Fulltext] Approved for External Publication Internal Posting Date: July 6, 2014 [Fulltext] Copyright 2014 Hewlett-Packard Development Company, L.P. Hewlett-Packard Laboratories 1501 Page Mill Road, Palo Alto (CA), USA Building a Transparent Batching Layer for Storm HP Labs Technical Report HPL-2013-69 Matthias J. Sax Databases and Information Systems Group Humboldt-Universität zu Berlin, Germany [email protected] www.dbis.informatik.hu-berlin.de Malu Castellanos Hewlett-Packard Laboratories Palo Alto (CA), USA [email protected] www.hpl.hp.com

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization of Building Energy Consumption by Optimizing the Transparency Area of Outer Shell in Urban Block Scale: Case of Hemmatabad Worn-Out Area in Isfahan

Addressing the issue of energy in contemporary Iranian architecture is very important at all stages of planning, design, construction and operation, and the effort to coordinate and interact with the environment is always one of the foundations of sustainable construction. On the other hand, much of the citychr('39')s surface is composed of worn-out areas with high levels of energy consumption....

متن کامل

Modeling the waves induced by the January 2019 storm in the Caspian Sea in order to investigate the reasons of partial damage of western arm of Caspian Port breakwater

Countries which are benefited from the sea and marine resources are trying to provide proper infra-structures to use the advantages of the seas. Ports, undoubtedly, are the primary and the fundamental infrastructure of every country in the field of marine economics. Governments all over the world spend huge funds on designing, building, and maintenance of these structures. According to the reco...

متن کامل

Adaptive Coherence Batching for Trap-Based Memory Architectures

Both software-initiated and hardware-initiated prefetching have been used to accelerate shared-memory server performance. While software-initiated prefetching require instruction set and compiler support, hardware prefetching often require additional hardware structures or extra memory state. The coherence batching scheme proposed in this paper keeps the system completely binary transparent and...

متن کامل

The Split and Merge (SAM) Protocol for Interactive Video-on-Demand Systems

A true Video-on-Demand (VOD) system provides the ultimate flexibility in video services by allowing users to select any video programs, at any time, and to perform any VCR-like user interactions. To allow true VOD, one approach is to have a dedicated video stream for each customer. This is expensive, especially when multiple identical video streams are sent to multiple customers accessing the s...

متن کامل

Appling Metaheuristic Algorithms on a Two Stage Hybrid Flowshop Scheduling Problem with Serial Batching (RESEARCH NOTE)

In this paper the problem of serial batch scheduling in a two-stage hybrid flow shop environment with minimizing Makesapn is investigated. In serial batching it is assumed that jobs in a batch are processed serially, and their completion time is defined to be equal to the finishing time of the last job in the batch. The analysis and implementation of the prohibited transference of jobs among th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014